Corpus: dan_news_2008_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 98 99 99 99 99
1000 838 961 991 992 993
10000 5629 8349 9396 9573 9620
100000 30794 65607 86628 92847 94593
1000000 30794 65607 86629 92848 94594


Zipf's diagram for sentence endings


Gnuplot diagram

5828 msec needed at 2018-02-07 07:27